Development and Evaluation of BenchBalance: A System for Benchmarking Balance Capabilities of Wearable Robots and Their Users

Recent advances in the control of overground exoskeletons are being centered on improving balance support and decreasing the reliance on crutches. However, appropriate methods to quantify the stability of these exoskeletons (and their users) are still under development. A reliable and reproducible balance assessment is critical to enrich exoskeletons’ performance and their interaction with humans. In this work, we present the BenchBalance system, which is a benchmarking solution to conduct reproducible balance assessments of exoskeletons and their users. Integrating two key elements, i.e., a hand-held perturbator and a smart garment, BenchBalance is a portable and low-cost system that provides a quantitative assessment related to the reaction and capacity of wearable exoskeletons and their users to respond to controlled external perturbations. A software interface is used to guide the experimenter throughout a predefined protocol of measurable perturbations, taking into account antero-posterior and mediolateral responses. In total, the protocol is composed of sixteen perturbation conditions, which vary in magnitude and location while still controlling their orientation. The data acquired by the interface are classified and saved for a subsequent analysis based on synthetic metrics. In this paper, we present a proof of principle of the BenchBalance system with a healthy user in two scenarios: subject not wearing and subject wearing the H2 lower-limb exoskeleton. After a brief training period, the experimenter was able to provide the manual perturbations of the protocol in a consistent and reproducible way. The balance metrics defined within the BenchBalance framework were able to detect differences in performance depending on the perturbation magnitude, location, and the presence or not of the exoskeleton. The BenchBalance system will be integrated at EUROBENCH facilities to benchmark the balance capabilities of wearable exoskeletons and their users.


Introduction
Wearable robotic devices such as lower-limb exoskeletons have attracted extensive interest in the last decades, demonstrating the ability to support people with motor impairments in standing and walking. Recent research on these devices is moving toward the development of controllers to assist balance during standing and/or walking. These controllers may involve different degrees of freedom (DOFs) to improve exoskeleton-user stability and may vary in complexity, using (1) control strategies derived from humanoids applications [1][2][3][4], (2) bio-inspired approaches [5][6][7], or (3) simple heuristic methods [8][9][10]. However, appropriate procedures and metrics to properly quantify, compare, and assess

BenchBalance System
For an accurate balance assessment and to be able to derive quantitative metrics of the balance capabilities of a human wearing (or not) an exoskeleton, we focused on the following requirements: • Quantify the disturbance applied in terms of force magnitude and orientation; • Quantify where a perturbation is applied on the human body, since balance recovery strategies might differ depending on the location of such perturbation; • Ensure an appropriate synchronization of the perturbation with the user's response; • Provide real-time feedback to the experimenter to augment the ability of providing perturbations in a consistent way; • Calculate outcome indicators to quantify the balance response by using kinematic data collected either with any motion capture system (Mocap) or with the on-board exoskeleton sensors.
Based on these requirements, the BenchBalance testbed ( Figure 1) is composed of the following: (1) a portable hand-held perturbator equipped with force and orientation sensors, which is used to provide and quantify well-defined pushes to the human upper body during both standing and walking conditions; (2) a position system detector (smart garment), which is used to determine the location of the generated perturbation in relation to the human wearing the exoskeleton; and (3) an analysis unit that combines the information of the provided force (magnitude, direction, and location) with joint angle data from the wearable exoskeleton or Mocap system and generates the Balance Indicators (BIs), i.e., metrics to complete the assessment.

Perturbator
A battery-driven manual perturbator (Figure 2a) was designed to provide and quantify the external disturbances to the user. The frame of the perturbator was made using PLA 3D printing material, which was reinforced with metal in the handles to increase its robustness. The maximum length of the perturbator is 412 mm with a diameter at the contact end of 90 mm. With a total weight (including sensors and electronics) of 1.2 kg, the perturbator can be easily handled by the operator, so the versatility of the device is very high in terms of usability and the ability to apply a wide range of perturbations.
The housing includes a command panel with a graphic display that indicates to the experimenter the most relevant real-time information on the perturbations, an USB connector to charge the battery, and a push button for on/off switching.

Electronics and Signal Processing
The perturbator is equipped with two main sensors used to measure the magnitude and the orientation of the perturbation: • A three-axial force sensor, K3D60a (ME Systeme GmbH, Hennigsdorf, Germany). The nominal force of the sensor is 500 N with an accuracy of 0.2% in all directions, which was a requirement to properly quantify the magnitude of the perturbation applied to the user. • An inertial measurement unit (IMU) MTi-3 AHRS (Xsens, Enschede, The Netherlands). Together with another inertial sensor integrated in the smart garment subsystem, it is used to estimate the relative orientation of the perturbation with respect to the human.
In addition to the previous main sensors, the electronics of the perturbator (Figure 2b) also include the following: • Amplifier with ADC 24 bits (HX711), used to measure the force detected by the force sensor. • LCD display, New Haven, NHD-2.8-25664UMY3, included in the command panel to show relevant real-time information to the experimenter, i.e., perturbation amplitude, impulse, and orientation. • Micro SD card, used to pre-store the data of the experiment that will be later transferred to the host computer (sampling frequency of data collected: 30 Hz). • Battery Management System (BMS), to control a one-cell lithium polymer (LiPo) battery (3.7 V, 2000 mAh) used to supply the microprocessor board and the sensors.
A microprocessor board (Teensy 3.6, ARM Cortex-M4 at 180 MHz, PJRC, Sherwood, United States) was used for the communication among all the components of the perturbator (Figure 2b). One thread of the processor was dedicated for Bluetooth transmission, and the second thread was used for the algorithms of the rest of the electronics. The firmware running in the microprocessor is written in C language. The internal communication between the different electronic components was performed by using both serial peripheral interface-SPI (for the LCD display), and inter-integrated circuit interface-I2C (for the force and IMU sensors and the BMS). Both protocols are synchronous, so the outputs of bits were synchronized to the sampling of bits by a clock signal shared between the master (Teensy 3.6) and the slaves.
To calibrate the 3D force sensor, an algorithm was developed, which is being executed every time that a protocol with BenchBalance is performed. During the calibration, 2000 data samples of the three axes are used to calculate the corresponding mean values per axis. The calibration of the IMU is performed similarly, deriving as output signals the quaternions and the Euler angles to indicate the orientation to the experimenter.
To write the raw data in the microSD, we use the SD protocol (SDIO). A Real-Time Clock (RTC) was used for inter-tasks communication, which is automatically calibrated when being connected to the host computer. Bluetooth 3.0 was selected to transfer the collected data in real time to the host computer (30 Hz). The decision was made based on internal requirements to ensure sufficient bandwidth and synchronization.

Smart Garment
A textile vest instrumented with polymeric optical fibers (POFs) (Figure 3) was designed to detect the area of application of the perturbation in the user wearing the exoskeleton. It offers a robust, minimally invasive, and multiplexed solution to monitor the pressure of the perturbation alongside the human upper body. The smart garment was manufactured in three different sizes, ensuring a broad application and accommodating the size difference between users. The textile vest is equipped with two embedded POFs made of polymethyl-methacrylate (PMMA-HFBR-EUS100Z, Broadcom, San Jose, United States). Each POF has a circular cross-section with three layers, being from inside to outside: the core, the cladding, and the buffer coating. The total diameter for our POF, including its polyethylene coating, is 2.2 mm, being the diameter of the core equal to 980 µm, and the thickness of the fluorinated polymer cladding equal to 20 µm.
Thirty sensors are distributed along the POFs, covering the front and back area of the trunk, the area below the arms, and the shoulders (Figure 3a). Each sensor has an independent source light (i.e., a light-emitting diode, LED), and each end of the POF has a photodetector (four in total) located in the electronic box placed on the back part of the garment. To increase the sensitivity of the sensors and to enable the side coupling of the light source, we followed the guidelines presented in [21].
The measuring principle for these sensors is based on light intensity variation, which is done by converting the light intensity provided by the LED into voltage. The multiplexing technique proposed by Leal-Junior et al. [22] was used to decouple the sensor's individual response and thereby enable knowing in which area of the smart garment the perturbation was being provided. This technique is based on the sequential activation of each LED with a predefined frequency and activation sequence.

Electronics and Signal Processing
The main sensors that compose the smart garment are the above-mentioned thirty POF sensors, together with an IMU sensor (Xsens-MTi-3 AHRS), which is integrated in the textile and has the same characteristics than the one of the portable perturbator. This IMU is used to measure the orientation of the human body. Thereby, by using both the IMU of the perturbator and the one of the smart garment, we can derive the relative orientation of the provided perturbation with respect to the human.
For the communication of the electronic components of the smart garment, we used a microprocessor board (NXP FRDM-KL25Z, Figure 3b). Similarly to what was done for the perturbator, the first thread of the microprocessor was dedicated to Bluetooth transmission and the second thread dedicated to the communication of the rest of electronics, including the following: • Four high-sensitive phototransistor detectors (Industrial Fiberoptics, IF-D92) to transform the light intensities into voltages. • Micro SD card module to pre-store the data collected during the experiment that later will be transferred to the host computer (sampling frequency of data collected: 30 Hz). • BMS (SparkFun Battery Babysitter) to monitor a LiPo battery (3.7 V, 2000 mAh), which is used to supply the microprocessor board and the sensors.
As done for the perturbator, the communication between the different electronic components of the smart garment was performed via SPI and I2C interfaces. Concretely, the Micro SD card was connected in this case via the SPI bus and the BMS via I2C bus. The IMU sensor was connected to the KL25Z via UART. The phototransistor detectors (IF-D92) were coupled to the ADC (16 bits) ports of the microprocessor.
To calibrate the thirty POF sensors, 200 data samples are taken from each of them and used to set threshold values to indicate the zero activation. A sensor is being recognized as active when its threshold value is being overpassed by pressing the sensor. During the use of BenchBalance, the four most active sensors within each data sample are being sent in a pack of data to the host computer.
For the IMU of the smart garment, we follow the same calibration procedure than with the IMU of the perturbator, calculating as well the quaternions and Euler angles. The global coordinate frames for both IMUs are also obtained during the calibration of the whole BenchBalance system.
The raw data of the smart garment are transferred in real time using Bluetooth 3.0 to the host computer and synchronized with the raw data of the perturbator by timestamps.

User Control Interface
To facilitate the use of the BenchBalance system, we developed an acquisition software interface (Figure 4), which acquires the data of the perturbator and the smart garment and guides the experimenter throughout the required steps to deliver reproducible perturbations to the human body. This interface is programmed in JavaScript-HTML, and it includes all the requirements to follow a pre-defined protocol of perturbations to assess the user's balance in both antero-posterior (AP) and mediolateral (ML) directions.
Before starting the measurements, it is required to calibrate the system to remove offset values and define the coordinate frames. This is done through the interface, which indicates to the experimenter how to put the perturbator and smart garment in the calibration position and for how long until the offset values have been removed. After calibrating the system, the experimenter should insert the anthropometric properties (height, total mass) of the test subject and indicate the presence or not of the exoskeleton, including its mass. These properties are used by the interface to calculate the tolerances of the perturbation magnitudes to be applied in the pre-defined protocol. The data recorded during the protocol are classified and saved at a host computer. It allows the posterior comparison between different recordings performances in a standardized way.

Controlled Variables and Protocol
Controlled variables (CVs) refer to parameters and features that are monitored by the acquisition interface to guarantee homogeneous assessment conditions. To account for different kinds of perturbations (magnitude, orientation, duration, and location) in both AP and ML directions, we defined a protocol with sixteen conditions for the standing balance (see Table 1). The CVs evaluated within this protocol include the following: • Perturbation magnitude: It is the maximum amplitude of the force (N) applied by the experimenter to the subject by means of the perturbator. In particular, the perturbator monitors the forces along all three axes, and we calculate the magnitude of the resultant vector. Within the BenchBalance protocol (Table 1), we consider two levels of perturbation magnitude: "Low", computed as 8 ± 2% of the total mass (body mass plus mass of the exoskeleton, if present); and "High", computed as 16 ± 2% of the total mass ( Figure 5). • Perturbation duration: It is the time interval along which the force is applied by the experimenter by means of the perturbator. It is calculated as the elapsed time of the force exceeding a ("no force") threshold of 5 N, and it is expressed in seconds ( Figure 5). For the BenchBalance protocol, an acceptable value of the perturbation duration is 0.35 ± 0.15 s for all conditions. • Perturbation orientation: It is the relative orientation between the human upper body and the direction of the force vector applied to the subject. It is measured by means of the IMUs of the smart garment and the perturbator, and it is expressed as pitch, yaw, and roll components (see reference frames in Figure 6). Within the BenchBalance protocol, we consider acceptable perturbations applied perpendicular to the human body with a tolerance of ±30 degrees in pitch and yaw angles for all conditions. • Perturbation location: It is the area of the subject's upper body where the force is applied. This location is extracted by checking the number of the four most active sensors in the smart garment, identifying them based on the zones reported in Figure 3.
• Exoskeleton: It is a Boolean value indicating the presence or not of the exoskeleton in the experiment (0: user without exoskeleton; 1: user with exoskeleton). If an exoskeleton is present, its characteristics (dimensions and mass distribution) are considered for BIs calculation, as described in the following section. Figure 5. Example of mean and SD of five suitable perturbations provided for Low (green) and High (blue) magnitudes. The perturbation duration is defined as the interval in which the force is exceeding the detection threshold, which is set to 5 N (dashed line). Table 1. Location and type of the perturbations within the pre-defined protocol. The selection of sensors is based on the zones defined in Figure 3. M stands for the total mass of the participant (+ exoskeleton if present). Five good repetitions of the same condition are required to be recorded by the user interface. If during the execution of the protocol, any perturbation occurs that does not comply with the previous CVs, the acquisition interface generates a flag indicating in real time to the experimenter that the provided perturbation was not correct, and thereby, it needs to be repeated. This flag is also stored in the saved data for the post-processing.
These BIs are derived from the kinematic data of the combined (human + exoskeleton) Center of Mass (CoM), which was calculated based on a simplified 2D model defined for both sagittal and frontal planes (Figure 7). This type of model is a well-known approach widely used in the literature for the analysis of postural reactions during stance [4,7,20,[23][24][25]. In our case, it is composed of a 5-segment rigid body system: bilateral shank segments, bilateral thigh segments, and HAT (head, arms, and trunk). The inter-segment connections were considered to be pure rotatory joints. The feet were assumed to be flat on the ground; i.e., the ankle joint was virtually connected directly with the ground.
Two parts of the model were defined for the analysis (Figure 7): (1) a sagittal plane model to account for AP perturbations; and (2) a frontal plane model to account for ML perturbations. Some assumptions were considered: • Left and right legs are considered identical and symmetrically placed. • Only ankle, knee, and hip pure flexion/extension angles were considered for the sagittal plane model, and only pure hip add/abduction was used to define the orientation for the frontal plane model. Each joint angle was derived by comparing the relative orientations of the proximal and distal segments around each joint. • Left/right angles were considered equal (left/right average of collected data).
As input parameters for the BenchBalance model, we required the following: (1) the length of the human segments (thigh, shank, and HAT [26]), which can be directly measured on the subject's body; (2) the masses and the CoM position of each segment, which are computed from the total subject mass by using approximations from standard anthro-pometric data [27]; and (3) the mass properties and the CoM positions of the single links of the exoskeleton, if present, to be combined with those of the human. The CoM of the human (CoM H ) and the CoM of the exoskeleton (CoM E , when the exoskeleton was present) were calculated separately by using the defined model with the assumption that human segments and exoskeletons links had exactly the same length, and hence, human and robotic joints were perfectly aligned. Subsequently, the total (human+exoskeleton) center of mass CoM T was calculated as follows: where M H and M E are the masses of the human and exoskeleton, respectively. Note that if the exoskeleton is not present, CoM T = CoM H . The computation of the CoM motion performed with the BenchBalance sagittal/frontal 2D model was validated by comparing it with a reference estimation computed by means of a commercial IMU-based motion capture system and its biomechanical software (Xsens MVN, Enschede, The Netherlands). This reference measurement system uses a full human tracking composed by 17 IMUs attached to different body segments, which estimate the 3D position and orientation of these segments. The positions and orientations, together with a body mass distribution model, result in the calculation of the total body CoM [28]. Representative data demonstrating the consistency between the BenchBalance model and the calculation from the full-body 3D mocap system model are reported in Appendix B.
For every perturbation that complied with the above-mentioned CVs, we used the CoM T to calculate the corresponding BIs. In the calculation of BIs, the model of Bench-Balance only uses joint angle data besides the anthropometric measures of the subject. Hence, even if the model is simplified, it is extremely versatile, since it can be applied to several experimental conditions in which joint angles can be measured in different ways, e.g., by IMU-based sensors, optoelectronic cameras, or angular sensors embedded in exoskeleton joints.
The two BIs of BenchBalanceare are calculated as follows: • Body sway (q sway ): It represents the maximum body angle in response to a perturbation. A high value of the q sway indicates less ability of the subject in maintaining balance. This BI was evaluated separately for AP and ML (Figure 7a,b) ). (2) • Recovery time: It stands for the time spent to recover from a perturbation, i.e., the time needed to get the CoM back to the rest position within a certain error tolerance.
Here, we consider that the position is being recovered if the sway velocity is lower than a threshold of 0.86 deg/s for at least 0.5 s after the perturbation onset.

Proof of Principle
To provide an example of the functional capabilities, data analysis, and derivation of metrics of the BenchBalance system, a proof of principle is reported (see movie on Video S1 and the system setup in Appendix C).
A healthy female participant (age 31 years old, weight 60 kg, height 1.76 m) was selected as the test subject. The BenchBalance system was tested in two scenarios: (1) participant without exoskeleton; and (2) participant wearing the H2 exoskeleton with hip, knee, and ankle actuators in passive mode (i.e., not active) [29]. In both scenarios, the participant wore the smart garment of BenchBalance and a Xsens full body motion capture suit (Xsens, Enschede, The Netherlands). The latter was used to record full body kinematics [28] and provide the joint angles that were later employed to extract the CoM motion, the sway angle (q sway ), and its velocity by means of the BenchBalance model (Figure 7). The Xsens suit was calibrated separately for each scenario, and the recorded data were synchronized with the BenchBalance data by means of a trigger signal received by both systems.
During the experiment, the participant was asked to stand with feet shoulder width apart and informed that she would receive perturbations with different intensities and locations (following the protocol presented in Table 1). The task of the participant was to counteract these perturbations without taking a step or lifting the heels or the toes.
The perturbations were manually delivered by an experimenter by means of the Bench-Balance perturbator. Consecutive perturbations were separated by at least three seconds to guarantee that the participant had recovered the initial standing position between perturbations.
The software acquisition interface of BenchBalance provided visual and audio feedback to inform the experimenter whether a perturbation was acceptable, i.e., within the selected tolerance ranges defined in Section 2.2.2, or needed to be repeated. Five acceptable repetitions of the same desired perturbation were required to be collected.

Performance of the BenchBalance System
The experimenter was able to familiarize with the BenchBalance system and could provide the perturbations defined in the protocol within 20 min. The system was accurate in assessing the magnitude, duration, orientation, and location of the perturbations and in giving feedback to the experimenter to keep the perturbations within the pre-defined tolerance ranges. Figure 8 demonstrates an example of how the system checked whether the perturbations were applied correctly. If during the execution of a specific condition of Table 1, the provided perturbation did not comply with the pre-defined tolerance ranges of magnitude, duration, location, or orientation, the system detected it (vertical gray bars in Figure 8) and provided the corresponding audio and visual feedback to the experimenter.
Once (and only if) five adequate perturbations were provided within the same condition, the acquisition interface of BenchBalance indicated to the experimenter to move to the next condition of the protocol. The pre-defined tolerance ranges for the magnitude, location, and orientation are represented by the colored areas: (a) Perturbator-Low perturbations area in green, and High perturbations area in blue; (b) Smart garment-left torso area in orange, and left shoulder area in red; (c) Yaw angle area in blue, and pitch angle area in red. On the (b) Smart garment subplot, S1 to S4 refer to the four most active sensors registered at a time. At least one active sensor must be within the targeted area of the garment to comply with the location of the perturbation. Vertical gray bars represent the flag provided by the BenchBalance system to identify the discarded perturbations due to non-compliance with any of the tolerance ranges of magnitude, duration, location, or orientation.

Participant and Exoskeleton Performances
The participant's balance reactions to the provided perturbations were assessed by the BIs of BenchBalance by using the simplified models presented in Figure 7. Repetition averages and standard deviations (SD) of reactions were computed by using the five adequately delivered perturbations for each condition of the protocol. The BIs of BenchBalance were able to detect and report differences in balance performance depending on the perturbation magnitude and the presence or not of the exoskeleton (Figure 9). In the majority of the conditions, both the q sway and the recovery time were larger with the High perturbation magnitude than with the Low one (see Figure 10 and Table A1 reported in Appendix A). Additionally, when the participant wore the exoskeleton, balance reactions involved larger joint excursions than when the exoskeleton was not present. In general, the reactions with the exoskeleton included hip and ankle balance strategies (stick diagrams in Figure 9), normally resulting in larger q sway and recovery times.
The response of q sway with and without the exoskeleton had the same pattern but with a different amplitude for all perturbation conditions except for the perturbations from the back (Figures 9 and 10). For the perturbations from the back, the response of the participant was dominated by a different balance strategy when she was wearing or not wearing the exoskeleton. When not wearing the exoskeleton, the total CoM T = CoM H (and thereby the q sway ) described a forward movement in AP direction due to the flexion of the trunk. However, when the participant wore the the exoskeleton, despite the flexion of the trunk, the combined CoM T did not generate a large positive value in AP, which might be because of two reasons: (1) the CoM E was behind the subject; (2) the participant also bent the knees to counteract the perturbation. This divergence in balance responses with/without the exoskeleton was more evident for High than for Low perturbation magnitude.

Discussion
An objective assessment of postural and balance control of exoskeletons and their users is indispensable to create a solid and unified benchmarking of wearable robots [11,30]. In the present contribution, we described in detail the hardware and software of the Bench-Balance system, which is a novel solution to assess and compare the balance capabilities of users wearing or not an exoskeleton.
To assess and compare these balance capabilities, the definition of performance metrics was required. A straightforward metric might be to account for the ability to tolerate the provided external disturbances, i.e., the maximum perturbation magnitude that the user + exoskeleton are able to successfully stand can be registered as a score. However, it is not safe or recommendable to "push until failure". Instead, we agreed upon the definition of other performance metrics that provided an interpretation of the balance capability in terms of the reaction to moderate perturbations. In that sense, BenchBalance projects includes two BIs: the evoked body CoM sway and the time needed to recover from the perturbation [20,23].
One of the main advantages of the BenchBalance system is that it is a low-cost and portable solution that gives a high versatility to the experimenter compared to other existing approaches, which consist of robotic devices that are complex and stationary, requiring structured experimental environments [17][18][19]. Its use is not restricted to even-ground laboratory conditions, which opens a wide range of options to assess balance in unstructured and challenging environments. Additionally, the use of BenchBalance does not require a fixation of the device to the human, as previous approaches do. This means that our system provides freedom to apply perturbations at different locations of the human body and in different directions.
Due to the fact that the perturbator of BenchBalance is controlled manually, it might require practice to become proficient in using it in order to keep the controlled variables within the tolerance ranges. To tackle this potential issue, we created a user-friendly acquisition interface, which instructed and gave feedback to the experimenter, making it easier to provide the pre-defined perturbations of the protocol. By using this interface, the experimenter of the test present study could learn to operate the system in less than 10 min, being able to finish the full protocol of perturbations performed in the present study within 20 min.
In the conducted experiment, we focused on a two-dimensional static balance assessment while keeping the feet in place and flat to the ground. This approach has been extensively used in the literature by other groups using similar simplified models of inverted single, double, or triple pendulum [4,7,20,[23][24][25]. Future extensions of the model used in BenchBalance will include both the measurement of the foot angles and the consideration of balance control as a three-dimensional problem. These advances, together with the portability of the system, will allow us to extend the calculation of the BIs to broader scenarios as the assessment of balance during locomotion or in more challenging conditions [31].
A possible shortcoming of (only) using the BIs proposed in BenchBalance might be the impossibility of differentiating between distinct balance strategies (e.g., hip or ankle strategies) [7]. As it was presented in the results of this manuscript, different courses in q sway are often associated with the use of distinct balance strategies, but the metric q sway does not include by itself information about the type of joint used to keep in balance. Moreover, the q sway and the recovery time do not quantify the human-likeness of the balance response. This evaluation of human-likeness might be interesting for future studies, since a wearable exoskeleton that behaves as a human could be perceived as more transparent and reliable by the user [4,30].

Conclusions
In this contribution, we presented the BenchBalance system, which is a portable benchmarking solution proposed to conduct reproducible balance assessments of wearable exoskeletons and their users. The two key elements of the system (a hand-held perturbator and a smart garment) have been described in detail, as well as the developed software interface and the algorithms used to perform the balance assessment.
A proof of principle of the BenchBalance system was carried out with a single participant in two case scenarios: participant without exoskeleton and participant wearing a lower-limb exoskeleton. It provided an indication of the functional capabilities of the system and the potential of the derived balance metrics.

Future Work
The BenchBalance system has been developed for the EUROBENCH consortium and will be incorporated as a testbed at the EUROBENCH facility. The purpose of EUROBENCH is to test the performance of wearable exoskeletons at any stage of development and on a large scale. This will generate future large datasets of the assessment and comparison of balance capabilities of different wearable exoskeletons and/or their implemented controllers. Funding: This research was funded by the European project EUROBENCH2020 (eurobench2020.eu), grant number n779963, BenchBalance.
Institutional Review Board Statement: Ethical approval was waived for this study because it only involved a proof of principle to test the functional capabilities of the system. It is classified as a low risk functional test with a single healthy participant. No sensitive personal data was collected.
Informed Consent Statement: Informed consent was obtained from the participant involved in the proof of principle.

Data Availability Statement: Not applicable.
Acknowledgments: The authors would like to acknowledge the contribution of Pablo Romero, who obtained the BenchBalance system integrated as a node of the Robot Operating System (ROS) at the EUROBENCH facility. This work benefited from work/discussions in the COST Action Wearable Robots CA16116, and the project Discover2Walk under Grant PID2019-105110RB-C31 by the Spanish Ministry of Science and Innovation.

Conflicts of Interest:
The authors declare no conflict of interest.

Appendix A
In this appendix, we present the results of the BIs calculated for each of the sixteen conditions of the protocol and the two scenarios tested (participant without and with the exoskeleton).

Appendix B
The calculation of the CoM performed with the BenchBalance model was compared with a reference estimation retrieved from a commercial Motion capture system and its 3D full biomechanical model (Xsens MVN [28]). In the table below, we show the mean ± SD across all the frontal and sagittal conditions of the protocol for the linear correlation coefficient (Pearson's correlation coefficient, r) and the root mean square error (RMSE) between the sway angle computed by both systems. To compute the mean ± SD for both planes, we used the data of this manuscript (including all sixteen protocol conditions) that referred to the scenario of the participant not wearing the exoskeleton. The RMSEs were calculated for the time interval 0.5 s before and 5 s after the perturbation onsets.

Appendix C
In this appendix, we show the system setup during a test with a human wearing an exoskeleton at the EUROBENCH facilities ( Figure A1). See also Video S1 in Supplementary Materials for a real demonstration of the use of the system. Figure A1. Setup for the use of BenchBalance together with a lower-limb exoskeleton and an IMUbased Mocap system.